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CHARACTERISING ANALYTES 

IMS invention telates to meflKKis of laWUing analyte molecules, paxticularly non-volatile 
biomolecules. ™th markers that enhance fte sensitivity with which their associated 
analyte biomolecule is detectable by MalriK Assisted Laser Desorption lonisatum 
(MALDO Mass Spectrometry. Spedflcally tWs tovention relal»s t» the ability of certam 
dyes to improve the sensitivity of detection by MALDI mass spectrometry of analytes 
libelled with these dyes. 

H,e identification of proteins in biological samples is an essential activity ofbiochemical 
analysis, particularly tiie determination of the sequence of a protein, since the sequence 
determines tixe slxucture of a protein, which, in turn, determines the function of the 
protem Traditional techniques for protein identification are cumbersome and relatively ^ 
slow The mainstay of protein identification techniques has been chemical sequencing of 

peptides using the Edman degradation, which can sequentially identify amino adds in a 
peptide from tiieN-terminus. This sequencing technique is lypically used in conjunction, 
^th emymatic digestion of a protein or polypeptide. Typically, an unidentified 
polypeptide is digested and its component peptides are separated from each other by 
chromatography. H.e individual peptides are liien subjected to Edman degradation. THe 
sequences of tiie peptides can be ordeared by comparing the sequences of peptides from 
digestion of the polypeptide witix different sequence specific cleavage reagents. This 
process allows the conq,lete sequence of a polypeptide to be determined. While tins has 
been a highly successfid technique for the identification of proteins, it is quite laborious. 
New technologies have made rapid protein identification more feasible such as Matrix 
Assisted Laser Desorption lonisation Time-of-Flight (MALDI-TOF) mass spectrometiy, 
which has permitted tiie development of peptide mass fingerprinting as a relatively rapid 
procedure for protein identification. 

A typical peptide mass fingerprinting protocol involves determining the mass of tiie 
unidentified protein followed by digestion of liie protein (in gel or in solution) witii 
enzymes such as trypsin. Trypsin cleaves polypeptides selectively at arginine and lysme 
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residues, leaving either arginine or lysine at the C-tennini of the product peptides. The 
positions of lysine and arginine in the sequence of a polypeptide detemiine where the 
polypeptide is cut giving rise to a characteristic series of peptides. The pattern of peptides 
can be easily detected by MALDI-TOF mass spectrometry. This mass specfrometric 
technique has a large mass range, can readily ionise large biomolecules, will 
preferentially- produce -singly charged ions aiid-cbmpetitibn for ionisation wifli this 
technique is not severe, although competition can be problematic. This means that there 
is generaUy one peak in the mass spectrum for each peptide, the mass-to-charge ratio for 
each peak has essentially the same value as the mass of the peptide, with an added proton 
to ionise the peptide, and most (and sometimes all) the peptides from the tryptic digest of 
an unidentified protein can be analysed simultaneously. In effect the mass spectrum is a 
*bar-code' in which the lines in the spectrum represent the masses of the characteristic 
cleavage peptides of the protein. For any given protein, there may be some peptides, 
which have the same mass as a peptide from another protein but it is very unlikely that 
two different proteins wiU give rise to peptides that all have identical masses. This means 
that the pattern of masses of the tiryptic digest of a protein is a fairly unique identifier of 
that protein and is called a Peptide Mass Fingerprint (PMF). The relative uniqueness of 
PMFs means that databases of calculated (or theoretical) PMFs, determined from known 
protein sequences or sequences that have been predicted from genomic DNA or expressed 
sequence tags (ESTs), can be used to identify proteins in biological samples (TPappin DJC, 
Hdjrup P and Bleasby AJ, Current Biology 3: 327-332, '«Rapid identification of jwoteins 
by peptide-mass fingerprinting." 1993; Mann M, Hojrup P, Roepstorff P, Biol Mass 
Specti-om 22(6): 338-345, •'Use of mass spectrometric molecular weight information to 
identify proteins in sequence databases." 1993; Yates JR 3rd, Speicher S, Griffin PR, 
Hunkapiller T, Anal Biochem 214(2): 397-408, "Peptide mass maps: a highly informative 
approach to protein identification." 1993). The PMF for an unidentified protem can be 
compared with all of the PMFs in a database to find the best match, tiiereby identifying 
the protein. Searches of this kind can be constrained by determining the mass of the 
protein prior to digestion. In this way the pattern of masses of an unidentified 
polypeptide can be related to its sequence, which in tiim can help to determine the role of 
a protein in a particular saiiple. 
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There are, however, many technical difficulties involved in determining tiie PMF for a 
protein. A typical protein will give rise to twenty to liiirty peptides after cleavage with 
trypsin, but not all of these peptides will appear in iJie mass spectrum. The precise 
reason^ for this are not fully understood. One fector lhat is beUeved to cause incomplete 
spectra is competition for protonation during the ionisation process, resulting in 
preferential ionisation of arginine containing peptides (Krause E. & Wenschuh H. & 
Jungblut P.R.. Anal Chem 71(19): 4160-4165. "The dominance of argmine-containing 
peptides in MALDI-derived tryptic mass fingerprints of proteins." 1999). In addition, 
there are surface effects that result from the process of preparing MALDI targets. The 
targets are prepared by dissolving the peptide digest in a solution of the matrix material. 
Small droplets of the peptide/matrix solution are dropped onto a metal target and left to 
dry. Diflferences in solubility of peptides will mean that some peptides will preferentiaUy 
crystallise near the top surface of the matrix where they will be desorbed more readily. 

Sensitivity is also a problem with conventional protocols for identifying proteins from 
their PMF. To be an effective tool, it should be possible to determine a PMF for as small 
a sample of protein as possible to improve the sensitivity of Ihe analysis of protein 



Some attempts have been made to inwove the ionisation of peptides that do not contain 
arginine by chemical derivatisation of peptides. Conversion of lysine to homo-arginine is 
one approach lhat has met with some success (V. Bonetto et al.. Journal of Protein 
Chemistry 16(5): 371 - 374, "C-termmal Sequence Determination of Modified Peptides 
by MALDI MS", 1997; Francesco L. Brancia, Stephen G. OHver and Simon J. Gaskell. 
Rapid Commun. in Mass Spec, 14, 2070-2073, "Improved matrix-assisted laser 
desorption/ionisation mass speclrometric analysis of tryptic hydrolysates of proteins 
following guanidination of lysine-containing peptides." 2000; Brancia et al.. 
Electrophoresis 22: 552 - 559, "A combination of chemical derivatisation and improved 
bioinformatics tools optimises protein identification for proteomics", 2001). The 
conversion of lysine to homo-arginine introduces guanidino fimctionalities into all of tiie 
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peptides from a tryptic digest, with the exception of C-tenninal peptides, greatly 
iinproving the r^esentation of lysine cQntaining peptides in the MALDI-TOF mass 
spectra. 

The derivatisation of peptides to introduce guanidino-groups is a method of improving 
the proton affinity of the derivatised peptide. This approach t6 improve sensitivily has 
been moderately successful in enhancing the sensitivity of detection in techniques that 
depend on protonation to achieve ionisation such as electrospray ionisation (ESI) and 
MALDI analysis. These techniques are most effective if the analyte does not afready 
possess a functionality with a high proton affinity, e.g. oligosaccharides (Okamoto et al.. 
Anal Chem. 69(15): 2919-2926, 'High-sensitivity detection and post-source decay of 2- 
aminopyiidine-derivatized oligosaccharides with matrix-assisted laser 
desorption/ionisation mass spectrometry." 1997). However, analytes that already contain 
functionalities that are readily protonated, such as trypsin digested peptides, do not 
benefit significantly from such reagents and other methods to improve the sensitivity of 
these sorts of analyte are needed. 

Various other reagents for derivatising peptides have also been developed. Reagents that 
introduce quatemaiy ammonium functionalities and quaternary phosphonium 
functionalities have been developed for positive ion mass spectrometry. Halogenated 
compounds, particularly halogenated aromatic compounds are well known electrqphotes, 
i.e. they pick up thermal electrons very easily A variety of derivatisation reagents based 
on fluorinated aromatic conqwunds (Bian N. et al.. Rapid Commun Mass Spectrom 
11(16): 1781-1784, 'TDetection via laser desorption and mass spectrometry of multiplex 
elecliophore-labelled albumin." 1997) have been developed for electron capture 
detection, which is a highly sensitive ionisation and detection process that can be used 
witii negative ion mass spectrometiy (Abdel-Baky S. & Giese R.W., Anal Chem 
63(24):2986-2989, "Gas chromatography/electron capture negative-ion mass 
specfrometry at the zeptomole level." 1991). A fluorinated aromatic group could also be 
used as a sensitivity-enhancing groiqj. Aromatic sulphonic adds have also been used for 
impro^dng sensiti^dty in negative ion mass spectrometry. 
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Each type of derivatisation reagent that has been disclosed in the prior art has different 
benefits and limitations, which depend on the method of ionisation used and on the 
methods of mass analysis used (for a review see Roth et al., Mass Spectrometry Reviews 
17:255-274, "CSiarge derivatisation of peptides for analysis by mass spectrometry", 
1998). The mechanism by which sensitivity is enhanced may also be different for each 
type of group. Some derivatisation methods increase basicity and thus promote 
protonation and charge localisation, which inproves sensitivity in surface desorption 
techniques like Matrix Assisted Laser Desorption Ionisation (MALDI) and Fast Atom 
Bombardment (FAB). So far, reagents that introduce charge-carrying functionalities or 
functionalities with high proton affinities have been developed for MALDI mass 
spectrometry but no reagents that improve the desorption of the analyte by increasing its 
ability to absorb light and to mix with the matrix have been reported. 

Negative ion mass spectrometry is sometimes more swisitive because there is less 
background noise. A tag that can enhance both negative ion mode detection and positive ion 
mode detection would have significant advantages. A tag for unifomaly improving 
sensitivity of all associated analytes has yet to be found for all mass spectrometry 
techniques and it is unlikely that a universal reagent will be found. However, for specific 
mass spectrometric techniques it should be possible to design reagents that take advantage 
of features of a particular technique to promote detection sensitivity. 

In this invention, derivatisation reagents and methods for their use have been developed 
to enhance the sensitivity of detection by MALDI mass spectrometry and it is anticipated 
that the reagents should be effective for most analytes that may be detected by this 
technique. This technique even enhances detection of peptides that -contain easily 
protonated functionalities such as amino and guanidino groups. The inventors have 
observed that coupling a MALDI matrix dye direcfly to a biomolecule such as a peptide 
prior to embedding that peptide in the free matrix will greatly enhance the sensitivity with 
which the peptide can be detected. 
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WO98/31830 describes arrays of cleavable labels that are detectable by mass 
spectrometry which identify the sequence of a covalently linked nucleic acid probe. These 
mass labels have a number of advantages over other methods of analysing nucleic acids. 
At present commercially favoured systems are based on fluorescent labelling of DNA. 
Fluorescent labelling schemes permit the labelling of a relatively small number of 
molecules simultaneously, typically 4 labels .can be used simultaneously and -possibly up 
to eight However tiie costs of the detection apparatus and the difficulties of analysing the 
resultant signals limit the number of labels that can be used simultaneously in a 
fluorescence detection scheme. An advantage of using mass labels is the possibility of 
generating large numbers of labels which have discrete peaks in a mass spectrum 
allowing similar nxmibers of distinct molecular species to be labelled simultaneously. 
Fluorescent dyes are expensive to synthesize whereas mass labels can comprise relatively 
siniple polymers permitting combinatorial synthesis of large numbers of labels at low 
cost This application describes the use of mass-modified MALDI matrix molecules for 
the labelling of biomolecules. Tags comprising MALDI matrix agents such as cinnamic 
and sinapinic acid can be attached to biomolecules through a photocleavable linker 
allowing cleavage and desorption of tags within a laser desoiption ionisation mass 
spectrometer without requiring additional matrix. 

WO99/60007 discloses mass tags comprising trityl functionalities for the labelling of 
nucleic acids and oligonucleotides. These tags can be cleaved fi-om ttieir associated 
oligonucleotides by photolysis in a MALDI-TOF mass spectrometer prior to desorption. 
The cleavage product is charged which is advantageous as it improves the sensitivity of 
the detection of the tags. This method also does not require additional matrix. 

The prior art therefore discloses methods and reagents for cleavable tags for use in 
MALDI mass spectrometry that may be desorbed without additional matrix. The present 
invention is distinguished by the fact that the tags are not cleaved from the analyte and 
that the tags are used in the presence of free matrix material. The absence of free matrix 
material is detrimental to the present iiwention. 
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It is o«ect of this invention to overcome fhe proMen» associated with ihe above pnor 
art fi, particular, it is an aim of this invention to provide methods and labels fl^ can be 

used to produce improved peptide mass fingerprints and which wUl improve &e 
sensitiviV of detection of ofcer labdlcd non-volatUe macromolecules particularly other 
biomolecules such as nucleic aci<te. 

It is a fi-rther object of to invention to provide con^unds vrtrich have desirable features 
as mass labels, and methods for the use of ftose con,»ond8 to provide in,»oved mass 
spectra of associated analytes. 

Accordingly, flie preset invention provides a method for characterising an analyte by 
assisted laser desotption ionisation QAAlSiJ) mass spectrometty. which meflxod 
comprises: 

(a) labelling the analyte with a Ught-absorbing label that absorbs Ught at a. 
pre-determined frequency, to form a labelled analyte; 

(b) embedding the labelled analyt» m a matrix formed from at least one 
compound that absorbs light, to form an embedded labelled analyte; 

(c) desorbing the embedded labelled analyte by exposing it to light having the- 
pre-determined frequency, to form a desorbed analyte; and 

(d) detecting the desorbed analyte by mass spectrometry to characterise the 
analyte. 

Thus the presence of a light-absorbing label attached to the analyt« causes the labelled 
analyte molecule to absorb energy, which in turn aids in energising tHe analyte to bring it 
into the gaseous phase where it can be detected by «ie mass spectrometer. Generally the 
subsl^ce or compound forming the matrix will also absorb the light and be brought mto 
the gas phase at the same time as the analyte. Thus, it is advantageous in the present 
invention, if the light-absorbing label and/or the compound forming the matrix are rapidly 
desorbed on the application of light. 
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The advantage of the methods, (and the compounds and kits) of the present invention is 
that they contribute to improving sensitivity, and can increase the numbear of peptides (e.g. 
smaU peptides, which are not detectable in conventional MALDI ejcperiments) that are 
detected from a protein. In addition, through the use of appropriate tags, it is possiT)le 
with (his invention to analyse multiple samples simultaneously and it is also possible to 
determine the ratios of corresponding peptides in the difF«^t-san5>les. With appropriate 
labelling procedures, it is also possible fecilitate the conditioning of polypeptide sample 
for detection by mass spectrometry. 

In a preferred embodiment, the desorbed analyte is directly detected by mass 
spectrometry. In the context of the present invention, this means that it is the analyte 
itself that is detected by mass spectrometry, rather than some other moiely such as a mass 
label which is only relatable to the analyte. This embodiment is particularly preferred 
when one or more peptide mass fingerprints (PMFs) are being produced to characterise a 
protein, or a population oOf proteins. 

Alternatively the desorbed analyte may be indirectly detected by mass spectrometry. In 
this embodiment the analyte is additionally labelled with a mass label relatable to the 
analyte. The mass label is preferably cleaved from the desorbed analyte before being fed 
into the mass spectrometer. The analyte is characterised by detecting the mass label and 
relating it to the analyte. 

hi the present invention, the Ught employed is not especiaUy limited, provided tiiat it is 
able to sufficiently excite the Kght-absorbing label and thus the analyte. TypicaUy the 
light to which the embedded labelled analyte is exposed is a laser light. The frequency of 
the light is also not especiaUy limited, and UV, visible or infrared hght may be employed. 

It is particularly preferred that the compound forming the matrix absorbs light at the same 
frequency as the light-absorbing label. This is efficient, since a single Ught source can be 
employed in the apparatus. One way of achieving this is to ensure that the matrix and the 
light-absorbing label are formed from same compound. This is a preferrtsd way of 
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practising the present invention. In this respect the jBnal light-absorbing label and matrix- 
forming con^jonnd need not be identical (they cannot be, since one is attached chemically 
to the analyte) but should instead be substantially the same, or should comprise the same 
or similar groups such that they absorb light at flie same, or substantially the same, 
wavelengths. 

It is preferred that the light-absorbing label is formed from a dye, such as a 
non-fluorescCTit dye. The advantage of a non-fluorescent dye is that the energy absorbed 
is retained in the molecule to aid in the dissipation into the gas phase, rather than lost by 
re-emitance as fluorescence. 

In the present invention, the analyte is not especially limited, and the invention may be 
employed to characterise any analyte. However, it is particularly well suited to 
biomolecules, such as proteins, polypeptides, peptides, peptide fragments and amino 
acids. 

The methods of the present invention are useful in producing a PMF from a protein or 
polypeptide. Accordingly, the present invention also provides a method for 
characterising a polypeptide, which method conq)rises the steps of: 

(a) optionally reducing cysteine disulpMde bridges in the polypeptide to form 
free thiols, and capping the free thiols; 

(b) cleaving the polypeptide with a sequence specific cleavage reagent to form 
peptide fragments; 

(c) optionally deactivating the cleavage reagent; 

(d) capping one or more s-amino groups that are present with a lysine reactive 
agent; 

(e) analysing peptide fragmente according to a method as defined above to 
form a mass fingerprint for the polypeptide; and 

(f) determining the identity of the polypeptide from the mass fingerprint. 
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When a population of proteins or polypeptides is to be investigated, the present invention 
can also be employed. In this respect, the present invention also provides a method for 
characterising a population of polypeptides, which method conqjrises the steps o£ 



(a) 



optionaUy reducing cysteine disulphide bridges in one or more 

polypeptides-to form free thiols, and <»pping the free thiols; 

(b) separating one or more polypeptides from the population; 

cleaving one or more polypeptides with a sequence specific cleavage 
reagent to form peptide fragments; 
(d) optionally deactivating the cleavage reagent; 

capping one or more 8-amino groups that are present with a lysine reactive 
agent; 

analysing peptide fragments according to a method as defined above to 
form a mass fingerprint for. one or more polypeptides; and 
(g) determining the identity of one or more polypeptides from the mass 
fingerprint 

The present invention can also be used to compare a number of protein or polypeptide 
samples. Thus, tiie present invention fiirfher provides a method for comparing a plurality 
of san^les, each sample comprising one or more polypeptides, which method comprises 
the steps of: 



(c) 

(d) 
(e) 

(f) 



(a) optionally reducing cysteine disulphide bridges and capping the free thiols 
in one or more polypeptides from the samples; 

(b) separating one or more polypeptides from each of tiie samples; 

(c) cleaving the polypeptides with a sequence specific cleavage reagent to forni 
peptide fragments; 

(d) optionally deactivating the cleavage reageni; 

(e) capping one or more e-amino groups that are present with a lysine reactive 
agen^ 
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(f) analysing peptide fragments according to a method as defined above to 
form a mass fingerprint for one or more polypeptides from the san9>les; and 

(g) determining the identity of one or more polypeptides in the samples from 
one or more mass fingerprints. 

m the latter ftree aspects of ihe present invention it is preferred that the lysine-reactive 
agent is a labelled lysine-reactive agent. The label can be en„loyed to distinguish which 
proteins or polypeptides originate from which samples, so that the samples can be pooled 
for analysis where appropriate. 

m these aspects of the present invention, the sequence specific cleavage agent cleaves the 
one or more polypeptides on the C-terminal side of a lysine residue. Preferably the 
specific cleavage reagent con^ses Lys-C or Trypsin. It is also preferred that the peptide 
fragments having capped s-amino groups are removed by affinity capture. Ibis can be 
achieved by ensuring that the lysine reactive agent conqmses biotin. 

The most preferred lysine reactive agent for use in flie present invention comprises a 
hindered Michael reagent Typically the hindered Michael agent comprises a confound 
having the following structure: 

c=c 

X R 

wherein X is an electron withdrawing group ftat is capable of stabilising a negative 
charge; the R groups independently comprise a hydrogen, a halogen, an alkyl, an aryl, or 
an aromatic group with the proviso that at least one of the R groups comprises a sterically 
hmdering group; and the group r2 comprises a hydrogen, a halogen, a hydrocarbon 
group, an electron withdrawing group and/or a linker capable of attachment to an affinity 
cj^ture fimctionality or a solid phase support 
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The present invention also provides a compound for labelling an analyte» which 
conopound has the following structure: 

D-M-L-A 

, wherein D comprisesLaJight absorbing label, M-comprises a mass modifier, L comprises a 
linker and A con:q>rises an aaiafyte. 

In a preferred embodhnent of the present invention, D comprises a dye, such as a non- 
fluorescent dye. TypicaUy, the non-fluorescent dye comprises a compound selected from 
4-dimethylaminoazobenzine-4'-sulphonyl chloride (DABSYL chloride), 
3-hydroxypicohnic acid, 2,5-dihydroxybenzoic acid and 4-hydroxy-alpha-cyanocinnamic 
acid. M may be any mass marker, and is not especially Imrited. Preferably M is selected 
fi:om a conapound formed firom an aryl ether, and an oligomer formed from 2 or more aryl 
ether units. 

The linker is not especially limited and preferably comprises a group selected from 
-CR2-CH2-SO2-, -N(CR2-CH2-S02-)2, -NH-CR2-CH2-SO2-, -CO-NH-, -C0-0-, 
-NH-CO-NH-, -NH-CS-NH-, -Cm-NH-, -S02-NH-, -NH-CH2-CH2- and 
-OP(=0)(0)0-. 

The analyte, A may be any analyte, as already discussed above in relation to the methods 
of the present invention. Preferably A is a biomolecule, such as a protein, a polypeptide, 
a peptide, a peptide fragment or an amino add. 

The present invention also provides a kit for characterising an analyte by matrix assisted 
laser desorption ionisation (MALDI) mass spectrometry, which kit comprises: 



(a) a light absorbing label havmg a reactive functionality for attaching the label 
to an analyte; and 
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(b) a coinpound for farmmg a matrix, which conq)ouBd absorbs Kght at the 
same frequency as the light-absorbing label. 

The present invention will be described in furflier detail by way of exan^le only with 
reference to the accompanying Figures, in which: 

Figure 1 shows a schematic description of the synthesis of two MALDI matrix mass tags 
based on 4-hydroxy-alpha-cyano-cmnamic acid; 

Figure 2 shows a High Performance Liquid Chromatography (HPLQ ixace from the 
coiq)ling reaction of a reactive tag of this invention with a synthetic peptide; 

Figure 3 shows a MALDI TOF MS spectrum of the peptide mixture shown in Figure 1; 

Figure 4 shows a MALDI TOF MS spectrum from ihe labelling reaction of a synthetic 
peptide with a reactive MALDI matrix mass tag that comprises a linker. 

Figure 5a shows a schematic of the labelling protocol for beta-MSH; 



Figure 5b shows Ihe MALDI TOF mass spectrum of the unlabelled and undigested beta 

MSHf 
Figure 



MSH peptide; 

6 shows spectra MALDI TOF for tiie unlabelled and labelled digests of beta-MSH; 



Figure 7 shows MALDI TOF mass spectra of this peptide mixed in various molar ratios 
with the digest of unlabeled beta-MSH; 

Figure 8 shows a number of MALDI TOF mass spectra for a serial dilution of a stock 
solution of 44 pmol of the labelled peptide comprising amino acids 7 to 11 of beta-MSH 
mixed with 180 pmols of digested and unlabeled beta-MSH - the first spectrum shows the 
MALDI TOF analysis of the stock solution; 
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Figure 9 shows a typical MALDI TOF peptide mass fingerprint mass spectrum for 
iinlabeUedBSA; 

Figure 10 shows the results of a search far immodified BSA peptides in the SWISSPROT 
database using the MS-Fit database search program; 

Figure 11 shows a MALDI TOF peptide mass fingerprint mass spectrum for labelled 
BSA; 



Figure 12 shows the results of a search for modified BSA peptides in the SWISSPROT 
database using the MS-Fit database search program; 

Figure 13 shows a typical MALDI TOF peptide mass fingerprint mass spectrum for 
unlabelled GAPDH; 

Figure 14 shows the results of a search for unmodified GAPDH peptides in the 
SWISSPROT database using the MS-Fit database search program; 

Figure 15 shows a MALDI TOF peptide mass fingerprint mass spectiiim for labelled 
GAPDH; and 



Figure 16 shows the results of a search for modified GAPDH peptides in the 
SWISSPROT database using the MS-Fit database search program. 

The present invention will now be described in more detail. 

In the present invention a label molecule of the following structure is employed for 
attaching the light-absorbing label (or dye) to the analyte of interest 



Dye — Reactive Functionality 
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where the dye « preferably non-fluorescent and preferably dissipates abs«b.d radiation 

thermally. 

m a preferred embodiment the label molecules comprise the following structure: 

Dye - Mass Modifier - Linker - Reactive Functionality 

In either case, the reactive functionality is not especially limited, provided that it is 
suitable for attaching the label to the analyte of interest. 

m a preferred aspect of fliis invention there is provided a meftiod of analysing an analyte 
molecule comprising the steps of: 

1. Labelling the analyte molecule with a dye that absorbs Ught of a 
predetermined frequency. 

2. Embedding tibe labelled analyte molecule in a matrix conqnising a dye that 
absorbs light at the frequency of the laser to be used in the desorption step. 

3 Desorbing the labelled analyte by appHcation of laser Hght of the 
predetermined frequency so as to effect sublimation of the matrix and thus 
also the labelled analyte. 

4. Detecting ions formed during the desorption step by mass spectrometry. 

In a ftirfher aspect of this invention these is provided a kit comprismg: 

1. A mass label molecule usable in Ihe first aspect of this invention; and 

2. A compatible MALDI matrix reagent. 

MALDI matrix dyes , 
Various compounds have been found to be useful as matrices for MALDI analysis of 
large biomolecules. These confounds are generally characterised by a number of 
properties. The compounds generally have a strong extinction coefficient at the 
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frequency of fhe laser used for desorption. The compounds are also able to isolate analyte 
molecules in a solid solution and the compounds are sufficiently voIatUe to rapidly 
sublime when exposed to laser shots in the MALDI mass spectrometer. The subliming 
dye should vaporise r^idly in a jet that entrains fhe embedded analyte molecules and for 
most purposes this should take place without fragmentation of the analyte (aMiou^ 
fragmentation may sometimes be desirable if structural information about flie analyte is 
sought). However, a matrix should not be too volatfle as experiments can sometimes take 
several hours and the analyte/matrix oo-crystal must remain stable under vacuum in the 
ion source for this period of time. The property of volatility to laser irradiation can be 
measured approximately by determining the initial velocity of analyte ions generated by 
the matrix. It has been observed that higher initial velocities correspond to 'softer' 
ionisation, i.e. reduced fragmentation, (Karas M. & Gltickmann M., J Mass Spectrom 34: 
467 - 477, 'The initial ion velocity and its dependence on matrix, analyte and preparation 
method in Ultraviolet Matrix-assisted Laser Desorption/Ionisation", 1999) but hi^ initial 
ion velocities of some matrices also correlates to rapid sublimation under vacuum. 

Different matrices have different properties in terms of their ability to assist in the 
desorption of embedded analytes and in the subsequent sensitivity with which the 
analytes are detected. It has been found empirically that certain matrices are more 
appropriate for the analysis of particular analytes than others. For exanq>le, 
3-hydroxypicolinic acid has been found to be most effective for analysing 
oHgonucleotides (Wu et al.. Rapid Commun. Mass Spectrom 7:142-146, **Matrix-assisted 
laser desorption time-of-flight mass spectrometry of oUgonucleotides using 
S-hydroxypicolinic add as an ultra-violet sensitive matrix'*, 1993), while 
2,5-dihydroxybenzoic acid and 4-hydroxy -alpha-cyano-cinnamic acid (HCCA) are both 
most effective for the analysis of peptides and proteins (Strupat et al.. Int. J. Mass 
Spectrom. Ion Proc. Ill: 89 - 102, "2,5-dihydroxyben2oic acid: a new matrix for laser 
desorption/ionisation mass spectrometry", 1991; Beavis et al.. Org. Mass Spectrom 27: 
156-158, "a-cyano-4-hydroxy cinnamic acid as a matrix for matrix-assisted laser 
desorption mass spectrometry", 1992). Various cinnamic acid derivatives have been 
found, to be effective fox the analysis of proteins (Beavis R.C. & Chait B.T., Rapid 
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Commun Mass Spectrom 3(12): 432-435, "Cirmamic acid derivatives as matrices for 
ultraviolet laser desorption mass spectrometry of proteins." 1989) and the choice of 
matrix is dependant on the nature of the analyte, for example simiapinic acid may be 
preferred over HCCA for large peptides and polypeptides, while HCCA is generally 
preferred for smaller peptides. 2,5-dihydroxybenzoic acid may produce less fragmentation 
than the cinnamic add derivatives in some cases. 

Most of the matrices discussed above have been acidic matrices. Basic matrices have also 
been developed and may be more appropriate for the analysis of acid-sensitive 
compounds (Fitzgerald et al.. Anal Chem 65(22): 3204-3211, "Basic matrices for the 
matrix-assisted laser desoiption/ionisation mass spectrometry of proteins and 
oligonucleotides." 1993). 

Infrared MALDI (IR-MALDI) is similar in principal to ultraviolet MALDI (UV-MALDI) 
in that analytes must be embedded in a matrix that preferably has a strong extinction, 
coefficient at the frequency of tiie laser in tiie desorption instrument Appropriate^ 
matrices tend to be different compounds from those used in UV-MALDI and Kquid: 
matrices are often used. Glycerol, urea, ice and succinic acid have all been shown to be' 
effective matrices for IR-MALDI (Tafrose et al.. Rapid Commun Mass Spectrom 13(21):' 
2191-2198, •*Insight into absorption of radiation/energy transfer in infrared matrix- 
assisted laser desorption/ionisatian: the roles of matrices, water and metal substrates." 
1999). However, some UV-MALDI matiices, such as cinnamic add derivatives, also 
work as IR matrices (Niu et al., J Am. Soc. Mass Spectrom. 9:1 - 7, "Dfrect cowpfoisoa 
of infrared and ultiuviolet wavelength matrix-assisted laser desoiption/ionisation mass 
spectrometry of proteins", 1998). 

Liquid matiices for UV-MALDI have also been explored (Ring S. & Rudich Y., Rapid 
Commun Mass Spectrom 14(6): 515-519, "A con^arative study of a liquid and a solid 
matrix in matiix-assisted laser desorption/ionisation time-of-flight mass spectrometry and 
collision cross section measurements.*' 2000; Sze et al., J Am Soc Mass Spectrom 9(2): 
166-174, **Foimulation of matrix solutions for use in matrix-assisted laser 
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desoiption/ionisation of biomolecules." 1998; Karas et al Mass Spectrom Rev 10: 335, 
1991). The simplest examples of liquid matrices comprise solutions of the matrices used 
as solids. True liquid matrices are also kno\vn such as nitrobenzoyl alcohol. Both types 
of matrix have some advantages in terms of sample consistency, stability under vacuum 
and ease of handling, however solid matrices tend to be more sensitive. In the context of 
the present invention, the improvements in SCTisitlvity ma;y justify the lise of liquid 
matrices. This may have particular advantages in the automation of sample preparation, 
as liquid handling robotics are widely available and the use of solutions of matrices, for 
solid matrix co-crystallisation, which readily clog dispensing devices can be avoided. 

Reactive tags comprising dyes and MALDI matrix dyes 

In the one aspect of this invention light-absorbing dye molecules attached to analytes are 
provided. Various dyes that are not conventionally used in MALDI mass spectrometry 
may be used with this invention. Some dyes that absorb strongly in UV frequencies are 
commercially available with reactive functionalites, e.g. 4-dimethylaminoa2obenzene-4'- 
sulfonyl chloride (DABSYL Chloride, Sigma-Aldrich, Poole, Dorset, UK). This reagent 
and similar UV absorbing dyes that thermally dissipate luminal excitation are applicable 
with this invention. 

It is also possible to prepare reactive dyes from conranercially available intermediates. A 
number of acidic matrices that are widely used for MALDI mass spectrometry, such as 
cinnamic, nicotinic and hydroxybenzoic acid derivatives, are commercially available. 
The acidic fimctionalily in most of fliese reagents is a carboxylic acid group. This 
fimctionalily may be readily converted to an active ester or acid chloride by conventional 
chemical methods (see for example Solomons, "Organic Chemistry", Fifth Edition 
published by Wiley). Preferred active esters include N-hydroxysuccinimide (NHS) esters 
and pentafluorophenyl esters. A variety of dyes will function in the methods of this 
invention, but preferred compounds absorb strongly in the frequency that is used for laser 
desoiption in the mass spectrometer. Typically Ultra-Violet (UV) radiation is used. 
Infra-red lasers are also used for some MALDI applications. 
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Cir^c add derivatives are preferred dye. tha, are widely used In W-MALDl TOF 
(Beavls RC. Chait FT, Rapid C«nmun Mass Spectrom 3(12):432^35. "Chmannc ae.d 
derivatives as matrices for ultraviolet laser deso.ptiou hmss spectrometry of protems. 
1989) A reactive derivative of dmumric acid is discussed in the exarr^les below. It.s 
anticipated that this reaserrt may be appUcable to both UV- and IR-MALDL 

Reactive Functionalities 

A variety of other reactive ibnctionalities may be appropriate to ,«q«re reactive dyes ibr 
,Be with this invention. Table 1 below lists some reactive ftnctionaBties tot may be 
incoTorated into a dye molecule. These reactive functionalities may be reacted wrto 
nucleopWlic ibnctionalities which are found in biomolecules, particularly in peptides and 
polypeptides. Reaction of to, reactive functionalities wifl. to nucleophilic toctionaht.es 
drown generates a covalent linkage betwe«i the two entities. This covalent Itakage is 
shown in to tUrd column of to table. For appUcations involving syntohc 
oligonucleotides, primary amines or thioU are often introduc«l during to syntosis at the 
termini of to molecules to permit labelUng. Any of to ftactionalities listed below could 
be introduced into to compounds of this invention to permit to mass maAers to be 
attached to a molecule of interest A reactive flmctionality can be used to introduce a 
flirtoi anker groups with a ftether reactive functionality if that is dBstoed. Table Us not 
intarded to be exhaustive and to present invention is not limited to to use of only to 
listed functionalities. 



20. 



4-Apr-02 



Table 1 



Nucleophilic Functionality 


Reactive Functionality 


Resultant Linkins Group 


-SH 


-S02-CH=CR2 


-S-CR2-CH2-SO2- 


j -NH2. 


-S02-CH=CR2 


-N(CR2-CH2-S02-)2or 
-NH-CR2-CH2-SO2- 


-NH2 ' 

-WJtl 


o 
o 


-CO-NH- 
-CO-O- 


-NH2 


o 

II "M 


-CO-NH- 


1 -OH 


o 


-co-o- j 


-NH2 


-NCO 


-NH-CO-NH- 


-NH2 


-NCS 


-NH-CS-NH- 


-NH2 


-CHO 


-CH2-NH- 


-NH2 


-SO2CI 


-SO2-NH- 


-NH2 


CH2=CH- 


-NH-CH2-CH2- 


-OH 1 


-OP(NCH(CH3)2)2 


-op(=oxo)o- 1 



It should be noted that in applications involving labelling oligonucleotides with the mass 
markers of this invention, some of the reactive functionalities above or their resultant 
linking groups might have to be protected prior to introduction into an oligonucleotide 
synthesiser. Preferably unprotected ester, thioether and thioesteis, amine and amide 
bonds are to be avoided, as these are not usually stable in an oligonucleotide synthesiser. 
A wide variety of protective groups is known in the art which can be used to protect 
linkages from unwanted side reactions. Altematively, amine functionalised 
oligonucleotides can be prepared using standard methods known in the art and these can 
be labelled with amine reactive reagents such as NHS-esters. 
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Analysis of labelled biomolecules 

m Hie second aspect of Hiis invention, a method of detecting biomolecules is provided. In 
this mefliod &e biomolecules are covalently labelled with a dye molecule. The labelled 
molecules are then embedded in a MALDI matrix of a further dye molecule, which may be 
the same or different ftom the dye used to label flie biomolecule. The labelled and 
embedded biomolecules are then analysed in a MALDI mass specliometer. The dye used 
to label the analyte biomolecule and the dye chosen as a free matrix are both chosen to 
absorb light strongly in the frequency used for the MALDI process. Typically, laser 
Ultra-Violet (UV) frequencies of 266 nm (Nd-YAG lasers) or 337 nm (Nitrogen Lasers) 
are used. 

Peptides and proteins are preferred biomolecules that benefit from the methods of this 
invention. A polypeptide or peptide or mixtures of polypeptides or peptides can be 
isolated for analysis by any of the conventional means such as electropboresis, 
chromatography or affinity chromatography. For the purposes of mass spectrometry, it is 
preferred that polypeptides or proteins are not contaminated with salts or detergents, 
particularly metal salts. Various techniques for desalting a polypeptide or peptide 
nrixture are known in the art such as gel filtration, dialysis or the use of hydrophobic 
resins. A particularly convenient and simple method for de-salting peptides involves 
aspiration of a small quantity of a solution of the peptide or polypeptide mixture in a 
pipette tip incorporating C18 packing materials. Salts and detergents will be removed 
first, substantially inq)rovmg tiie detection sensitivity of the analysis of the peptides. 
Pipette tips pre-packaged with appropriate resins and instmctions for their use are 
commercially available from Millipore (Bedford, MA, USA) under the trademark 'Zip 
Tip'. Desalting procedures may take place after labelling of the analyt» to remove 
unreacted tags, if preferred. 

Proteins contain various nucleophilic functionalities fliat can be labelled usmg reagents 
that are reactive to these fimctionalities. Proteins typically contain thiol, amino, hydroxyl 
and imidazole groups. These may all be labelled with appropriate reagents if desired. lii 
preferred embodiments of this invention, amino groups are labelled. Amino groups may 
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be labelled with a variety of labels but acid chlorides and active esters are usually the 
most selective reactive functionalities. Preferred UV-absorbing dyes for use with this 
invention include active esters of drmamic acid and its derivatives, active esters of 
nicotinic add or active esters of hydroxybenzoic add derivatives. Thus, in one 
enabodiment of the second aspect of this inventions, isolated peptides in a mixture are 
labelled with an active &sUnr of 4-hydroxy-alpha-cyano^cinnainic acid. The p^tides are 
desalted using a Zip tip and then embedded in a matrix of unmodified 4-hydroxy-alpha- 
cyano-cinnamic acid. Typically, a solution of the matrix is prepared in a volatile solvent 
such as acetonitrile containing a small amount of trifluoroacetic add (0.1 to 0.5% by 
volume is sufficient). This solution is then pipetted onto a metal target to form small 
droplets. A small quantity of the desalted, labelled peptide solution is then dropped into 
the droplet of matrix solution. This solution is then left to dry so that the peptides can co- 
crystallise with the matrix. In other techniques the matrix solution is allowed to dry and 
crystallise before the peptide solution is added on top (Hutchens and Yip, Rapid 
Commun. Mass Spectrom 7: 576 - 580, 1993). This procedure may also be repeated to 
produce layers of co-crystallised analyte and matrix. These and other variations of the 
co-crystalhsation technique may be used to improve the analysis of peptides or 
polypeptides. Liquid matrices, as discussed above may also be used. The matrix/peptide 
co-crystals are then analysed by laser desoiption in a MALDI-TOF mass spedrometer. 

Mass Spectrometers 

The essential features of a mass spectromet^ are as follows 

Inlet System -> Ion Source -> Mass Analyser -> Ion Detector -> Data Capture System 

There are various inlet systems, ion sources and mass analysers that can be used for the 
purposes of analysmg large biomolecules but in the context of this invention the ion 
source is a Matrix Assisted Laser Desoiption ion source for which there are only a limited 
number of inlet systems. A variety of mass analysers, ion detectors and data capture 
systems may be used v/ith MALDI although some mass spectrometer geometries are not 
commercially produced. Time-of-flight mass analysers are typically used witli MALDI as 
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weU as Fourier Transform Ion Cyclofon Resonance mass analysers and 
Quadmpole^ime-of-flight mass analysers. In principle ion traps and sector instruments 
can be used vdth MALDI but generally these are not commercially produced. 

Inlet Systems 

Moving belt systems are one of the few inlet systems that have been used wi& laser 
desorption for the analysis of labelled nucleic acids (Limdao Xu et al., Anal. Chem. 69: 
3595 - 3602. "Electrophore Mass Tag Dideoxy DNA sequencing", 1997). Although Ihis 
technology luis not been used with MALDI it should be trivial to adapt such interfaces for 
use with the reagents and methods of this invention. 

Matrix Assisted Laser Desorption lonisation (MALDI) 

MALDI requires that the biomolecule solution be embedded in a large molar excess of a 
photo-excitable 'matrix'. The application of laser light of the appropriate frequency 
results in the excitation of the matrix which in turn leads to rapid evaporation of the, 
matrix along wife its entrapped biomolecule. Proton transfer from the acidic matrix to the : 
biomolecule gives rise to protonated forms of the biomolecule which can be detected by 
positive ion mass spectrometry, particularly by Time-Of-FUght (TOF).mass spectrometry. 
Negative ion mass spectrometry is also possible by MALDI TOF. This technique in5,arts ; 
a significant quantity of translational energy to ions, but tends not to induce excessive 
fragmentation despite this. The laser energy and the timing of the application of the 
potential difference used to accelerate the ions from the source can be used to control 
fr^entation with this technique. This technique is highly , favoured for the 
detennination of peptide mass fingerprints due to its large mass range, due to the 
prevalence of singly charged ions in its spectra and due to the abiUly to analyse multiple 
peptides simultaneously. 

The photo-excitable matrix comprises a 'dye' , i.e. a compound that strongly absorbs light 
of a particular frequency, and which preferably does not radiate that energy by 
fluorescence or phosphorescence but rather dissipates the energy thermally, i.e. through 
vibrational modes. It is the vibration of the matrix caused by laser excitation that results 
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in rapid sublimatioa of the dye which simultaneously takes the embedded analyte into the 
gas phase. 

Mass Analysers 

- Ttme-of-Flighf Mms Analysers 
As the name implies, Time-of-flight mass analysers measure the time it takes for ions to 
travel a predetermined distance under the influence of a predetermined potential 
difference. The time-of-flight measurement allows the mass-to-charge ratio of ions 
striking a detector to be calculated. These instruments measure the arrival of almost all of 
the ions in a sample and as a result can be quite sensitive although, selectivity with tins 
te<*nique is more difficult to achieve. This technique can also detect ions with higher 
mass-to-charge ratios than can typically be measured in an ion trap or quadrupole mass 
spectrometer. TOF mass analysers are presentiy widely used with MALDI. 

Ion Traps 

Ion Trap mass analysers are related to the quadrupole mass analysers. The ion trap 
generally has a 3-electrode construction - a cylindrical electrode with 'cap' electrodes at 
each end forming a cavity. A sinusoidal radio frequency potential is applied to the 
cylindrical electrode while the cap electrodes are biased with DC or AC potentials. Ions 
injected into the cavity are constrained to a stable drcular trajectory by the oscillating 
electric field of the cylindrical electirode. However, for a given anqslitude of the 
oscillating potential, certain ions will have an unstable trajectory and will be qected from 
the trap. A sample of ions injected into the trap can be sequentially ejected from the trap 
according to their mass/charge ratio by altering the oscillating radio frequency potential. 
The ejected ions can then be detected allowing a mass spectrum to be produced. 

Ion traps are generally operated with a small quantity of a 'bath gas', such as helium, 
present in the ion trap cavity. This increases both the resolution and the sensitivity of the 
device as the ions entering the trap are essentiaUy cooled to the ambient tenq)erature of 
the bath gas through collision wi th the bath gas. Collisions both increjasc ionisation when 
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a sairple is introduced into ^e trap and dampen the anqjlitude and velocity of ion 
trajectories keeping them nearer flie centre of liie trap. This means that when the 
oscillating potential is changed, ions whose trajectories become nnstable gam energy 
more rapidly, relative to the damped circulating ions and exit flie trap in a tighter bimdi 
giving a narrower larger peaks. 

Ion traps can mimic tandem mass spectrometer geometries, in feet they can mimic 
xnultiple mass spectrometer geometries allowing complex analyses of trapped ions. A 
single mass species from a sanq)le can be retained in a trap, i.e. all other species can be 
ejected and then the retained species can be carefully excited by super-imposing a second 
oscillating frequency on the first The excited ions will then collide with the bath gas and 
wai fragment if sufficiently excited. The fragments can then be analysed further. It is 
possible to retain a fragment ion for forther analysis by egecting other ions and tiien 
exciting the fragment ion to fragment This process can be repeated for as long as 
sufficient sanqjle exists to permit further analysis. It should be noted that iiiese, 
instruments generally retain a hi^ proportion of firagment ions after induced 
fragmentation. Hiese instruments and FTICR mass spectrometers (discussed below) ; 
represent a form of ten^orally resolved tandem mass spectrometry rather tiian spatially, 
resolved tandem mass spectrometry which is found in Ihiear mass spectrometers. 

Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (FTICR MS) 
FTICR mass spedromeiry has similar features to ion traps in that a sample of ions is 
retained within a cavity but in FHCRMS the ions are trapped in a high vacuum chamber 
by crossed electric and magnetic fields. A pair of plate electrodes that form two sides of a 
box genemtes tiie electric field. The box is contained in the field of a superconducting 
magnet which in conjunction with the two plates, the tapping plates, constrain injected 
ions to a circular trajectory between the trapping plates, perpendicular to the applied 
magnetic field. The ions are excited to larger orbits by applying a radio-frequency pulse 
to two 'transmitter plates', which form two further opposing sides of the box. The 
cycloidal motion of the ions generates conresponding electiic fields in the remaining two 
opposing sides of tbe box, which comprise the 'receiver plates'. The excitation pulses 
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excite ions to larger orbits which decay as the coherent motions of the ions is lost through 
collisions. The corresponding signals detected by the receiver plates are converted to a 
mass spectrum by Fourier Transform (FT) analysis. 

For induced fragmentation experiments these instruments can perform in a similar manner 
to an ion trap - all ions except a single q>ecies oTmterest can fee" ejected from the trap. A 
collision gas can be introduced into the trap and fragmentation can be induced. The 
fragmoQt ions can be subsequently analysed. 

Analysis of peptides by mass spectrometry 

Fragmentation of peptides by collision induced dissociatian, to determine their sequence, 
may be used in this invention to identify proteins, not identified by the pattern of masses 
of their digestion products. In addition, fragmentation can also be obtained by photo- 
excitation or by electron impact Various mass analyser geometries may be used to 
fragment peptides and to determine the mass of the fi*agmen1s. 

MS/MS andMl^ analysis of peptides 

Tandem mass spectrometers allow ions with a pre-determined mass-to-charge ratio to be 
selected and fragmented (by collision induced dissociation (CID), for example). The 
fragments can then be detected providing stinictural information about the selected ion. 
When peptides are analysed by OD in a tandem mass spectrometer, characteristic 
cleavage patterns are observed, which allow the sequence of the peptide to be determined. 
Natural peptides typically fragment randomly at flie amide bonds of the peptide backbone 
to give series of ions that are characteristic of the peptide. OD fragment series are 
denoted an, bn, Cn, etc. for cleavage at the n* peptide bond where the charge of the ion is 
retained on tiie N-terminal fragment of the ion. Similarly, fragment series are denoted Xn, 
yn» Zn, etc. where the charge is retained on tiie C-terminal fragment of flie ion. 
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Trypsin Lys-C and thrombin are favoured cleavage agents for tandem mass spectrometry 
as they pro<i«ce peptides with basic groups at both ends of the molecule, i.e. the alpha- 
amino group at flie N-terminus and lysine or arginine side-chains at the C-terminus. This 
favours the formation of doubly charged ions, in which the charged centres are at 
opposite termini of the molecule. These doubly charged ions produce both C-termmal and 
N-terminal ion series after CID. This assists in determining flie sequence of the peptide. 
Generally speaking only one or two of the possible ion series are observed in tiie CDD, 
spectra of a given peptide. In low-energy colUsions typical of quadrupole based: 
instruments the b-series of N-terminal fragments or the y-series of Cterminal fragments 
predominate. If doubly ch^ged ions are analysed then both series are often detected. In 
general, the y-series ions predominate over the b-series. 

hi general peptides fragment via a mechanism that involves protonation. of the amide 
backbone follow by intramolecular nucleophihc attack leading to the formation of a 
5-membered oxazolone structure and cleavage of the amide linkage that was protonated 
(Schlosser A. and Lehmami W.D. J. Mass Spectrom. 35: 1382-1390. 'Tiv^-membered 
ring formation in unimolecular reactions of peptides: a key structural element controUmg 
low-energy coUision induced dissociation", 2000). Figure 16a shows one proposed 
mechanism by which this sort of fragmentation takes place. This mechanism requires a 
carbonyl group from an anride bond adjacent to a protonated amide on the N-terminal 
side of the protonated amide to carry out the hucleophiHc attack. A charged oxazolonium 
ion gives rise to b-series ions, while proton transfer from the N-terminal fragment to the 
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C-terminal fragment gives rise to y-series ions as shown in Figure 16a. This requirement 
for an appropriately located carbonyl group does not account for cleavage at amide bonds 
adjacent to the N-terminal amino acid, when the N-terminus is not protected and, in 
general, b-series ions are not seen for the amide between the N-tenninal and second 
amino acid in a peptide. However, peptides with acetylated N-termini do meet the 
structural requirements of this fliechaiiism and ftagmwitetion can tike place at the amide 
bond immediately after the first amino add by this mechanism. Peptides labelled on 
lysine or at an alpha-amino group with active ester tags of this invention should not 
undergo significant cleavage at the tag peptide during low energy CID analysis if there is 
no appropriately located carbonyl group in the tag molecule. This means that CID spectra 
for labeUed peptides should be similar to the speclia for unlabelled peptides although the 
spectra will show mass shifts corresponding to multiples of the mass of the tag. This can 
be readily conqiensated for in the analysis of such CID spectra. 

A typical tandem mass spectrometer geometry is a triple quadrupole which comprises two 
quadnipole mass analysers separated by a collision chamber, also a quadrupole. This 
collision quadrupole acts as an ion guide between the two mass analyser quadrupoles. A 
gas can be introduced into the collision quadrupole to allow collision with the ion stream 
from the first mass analyser. The first mass analyser selects ions on the basis of their 
mass/charge ration which pass through the colHsion ceU where they fragment. The 
fragment ions are separated and detected in the third quadrupole.. Induced cleavage can be 
performed in geometries other than tandem analysers. Ion trap mass specfrometers can 
promote fragmentation through excitation of a defined m/z range resulting in coUision 
with the bafh gas. Ion traps generaUy contain a bath gas, such as helium. Similarly photon 
induced fragmentation could be applied to tr^ed ions. Another fevourable geometry is a 
Quadrupole/Orihogonal Time of Flight tandem instrument where the high selectivity of 
the quadrupole is coupled to the high sensitivity of a reflectron TOF mass analyser to 
identify the products of fragmentation. 

Conventional 'sector- instruments are another common geometry used in tandem mass 
spectrometry. A sector mass analyser comprises two sepai aie 'sectors', an electric sector 
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which fociises an ion beam leaving a source into a stream of ions with the same kinetic 
energy using electric fields. The magnetic sector separates the ions on the basis of their 
mass to generate a spectrum at a detector. For tandem mass spectrometry a two sector 
mass analyser of this kind can be used where the electric sector provide the first mass 
analyser stage, the magnetic sector provides the second mass analyser, with a collision 
cell placed between the two sectors. Two complete sector mass anal^^ers separated by a 
collision cell can also be used for analysis of mass tagged peptides. 

The present invention will be described in further detail by way of exanq)le only with 
reference to the following specific embodiments. 

Examples 

Example 1 - Synthesis of amine-reactive derivatives of 4-hydroxy<Apha-<yaiw-cmnamic 
acid 

Two different reactive derivatives of 4-hydroxy-alpha-cyano-cinnamic acid (HCCA) were 
synthesised. The first derivative prepared was the N-hydroxysuccinimide (NHS) ester of 
the carboxyl function of HCCA. The NHS ester of HCCA (NHS-HCCA) is an amine 
reactive derivative (see Figure 1). The first synthesis was successful and gave good 
results on the MALDI but the yield of the reaction with peptides was not as good as 
desired so a second synthesis was devised to provide a derivative of HCCA which 
incorporated a spacer between the cinnamic acid functionalily and the reactive NHS ester 
group. This second reagent (see Figure 1) will be referred to as NHS-L-HCCA is also 
amine reactive. 

Example 2 - Synthesis of 2-cyano-3-(4-tert-butoxycarbonyloxyphenyl) acrylic acid 
N-hydroxysuccinimide ester [NHS-HCCA] 

The synthesis of NHS-HCCA is shown in Figure 1 . In the furst step the hydroxyl function 
of HCCA is protected followed by activation of the carboxylic acid moiety. 
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Example 2a - Synthesis of2'^cyano-3-(4'tert-butoxycarbonyloj(yphenyl) acrylic acid 
5.6g (SOmmol) of 2-cyano-3-(4-hydroxyphenyl) acrylic acid dissolved in 40 ml 
acetonitrile were stirred with 4.6 ml (33 mmol) of triethylamine until a clear solution was 
obtained. 0.2 g (4 mmol) 4-dimethylaminopyridine and 7.3 g (33 mmol) 
^^ft-butylpyrocarbonate were then added to the clear solution. After 2 hoiirs reaction (the 
solufiori is at that "point colourless), flie solution was reduced to its half volume and 
washed with 200 ml of a cold 5% citric acid solution. 

The organic layer was extracted by ethyl acetate. After evaporation of the solvent, the 
product was precipitated from heptane/ethyl acetate. 

Yield: 7.4 = 85% 
Melting Point: 120-121^0 

Example 2b - Synthesis of 2-cyanO'3''(4-tert'butoxycarbonyloxyphenyl) acrylic acid 
N-hydroxysuccinimide ester 

0,95 g (5 mmol) 2-cyan-3-(4-tert-butoxycarbonyloxyphenyl) acrylic acid and 0.6 g 
(5 mmol) N-hydroxysuccinimide were dissolved in 10 ml acetonitrile. 1.1 g (5.7 mmol) 
N-ethyl-N'-3-dimethylaminodicyclohexylcarbodiiniide hydrochloride were added in 
portions to the reaction mixture. After 2 hoiirs, the solvent was evaporated. The residue 
obtained was washed with ethyl acetate and with a cold 5 % citric acid solution and then 
with a saturated NaCl solution. The product was purified by chromatography. 

Yield: 1.2g=63% 

Example 3 - Synthesis of 2'Cyano-3-(4-hydroxyphenyl) acrylic acid (6-[2,S'- 
dioxo-pyrrolidinyl-N-oxy] "6-0X0) hexyl amide [NHS-L-HCCA] 

The synthesis of NHS-L-HCCA is shown in Figure 1. In the first step a chlorinated linker 
is produced. The chlorine group is then nucleophiUcally substituted by a cyanide ion. 
This cyano linker is then condensed with 4-hydroxyb©Qzaldehyde to give a cinnamic acid 
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derivative with a six carbon chain linker with a free carboxyl group that is, activated to 
form an NHS-ester in the final step of the synthesis. 

Example 3a - Synthesis of 6-(chloracetamido) hexanoic acid 

18 ml (221 mMol) chloracetylchloride was added dropwise to 20 g (153 mmol) 
6-aniinohexanoic acid dissolved in 80 ml cold NaOH solution (2N) at room tenq)eratiire 
(RT). The reaction mixture was stirred for 30 minutes while the pH of the solution was 
kept between 10-11 Avith occasional addition of NaOH solution (6N)* The pH of the 
reaction mixture was then altered to pH 5 with HCl (2N) and the residue was filtered. 
The residue was then washed with water until the pH of the water was neutral* The 
product, dried over phosphoras pentoxide, was re-dissolved in 300 ml chloroform and 
filtered to remove the undissolved residue* Heptane was added to the filtrate and a synq> 
was obtained by stirring under cooling. The product was filtered, dried and was then 
orj^talUzed firom water. 



Yield: 20 g=63 % 
Melting Point: 82°C 

Example 3b - Synthesis of 6-(cyanoacetamido) hexanoic acid . ; 

2.8 g (20 mmol) potassium hydrogen carbonate were added to 8.3 g (40 mmol) 
6-(chloracetamido) hexanoic acid dissolved in 25 ml water. 3.2 g (48 mmol) potassium 
cyanide was then added to the clear solution which was cooled on ice. The reaction 
mlxtiire was stirred for 17 hours and was then acidified with HQ (2>J)- The residue after 
extraction was purified by chromatography (silica gel, solvent: ethyl acetate). 



Yield: 6g-76 % 
Melting Point: 80-8PC 
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Example 3c - Synthesis of2-cyano-3-(4'hydroxyphenyl) acrylic acid (6-hydroxy-6'Oxo- 
k&Q>l)anUde 

7.5 g (38 imnol 6-(cyanacetaimdo) hexanoic acid was dissolved in 50 ml pyridine. 4.6 g 
(SSmmol) 4-hydroxybenzaldehyde and 0.7 ml (7mmol) piperidine were added to the 
soluti(Mi and the reaction mixture was stirred for 2 hoixrs at 50''C. A ftirther aliquot of 
D.4 ml piperidine was then added to tKe reaction aad the solutian was istiired at RTfcMf 60 
hours. The residue after evaporation of the pyridine was acidified with HQ (2N) and was 
extracted with ethyl acetate. The organic layer was poiired into 300 ml of a saturated 
solution of sodium hydrogen carbonate. The aqueous layer was then carefully acidified 
with HCl (2N). The precipitated residue was filtered and washed with water. The dried 
product was crystallized from dioxane. 

Yield: 8.4g=74% 
Melting Point: 178°C 

Example 3d - Synthesis of 2-cyano-3-(4-hydroxyphenyl) acrylic acid (6-[2,5- 
dioxo-pyrroUdinyl-N-oxy]-6-oxo) hexyl amide 

2.3g (20mmol) N-hydroxysuccimide and 6.0g (20 mmol) 2-Cyan-3-(4-hydroxyphenyl) 
acrylic acid [6-hydroxy -6-oxo-hexyl] amide were dissolved in 200°ml dioxane at 
60-70°C. 4.6g (22 mmol) of dicyclohexylcarbodiimide was added to the reaction mixture 
which was stirred and cooled to 30°C. This temperature was maintained for one hour. 
The reaction mixture was then brought to room tenq)erature and stirred for 20 hours. The 
solution was filtered and evaporated. The residue was treated with dicbloromethane and 
a solution of sodium bicarbonate. The organic layer after extraction was washed with 
water and dried over sodium sulphate. After evaporation of the solvent, the residue was 
then chromatographed on silica gel. The product was crystallised from ethyl acetate. To 
avoid the exothermic reaction caused by depositing the product onto silica gel, the 
column was first cooled. 



Yield: 50g=63% 
Melting Point: 142*'C 
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Example 4 - Reaction of NHS-HCCA with Resin-bound Peptide 

The first active ester cinnamic acid mass tag (NHS-HCCA), whose synthesis is described 
above, was attached to resin bound peptides according to the following procedure. 

Target Pept^e 

The peptide sequence, HRDPYRFDPHKD, was synthesised on a Wang type resin using 
standard FmocABu synthesis procedures (see for exanq)le Fields G.B. & Noble R.L., Int J 
Pept Protein Res 35(3): 161-214, "Solid phase peptide synthesis utilizing 
9-fluorenyhnethoxycarbonyl amino acids." 1990). The resin-bound peptide yields a free 
add moiety after cleavage. 
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KHS-HCCA 




Molecular Weight = 
Exact Mass = 
Molecular Formula = 

Molecular Cbinposition 



286.24 
286.06 

C14H10N2O5 
= C 58.7 %, H 3.5 %, N 9.8 %, O 28.0 % 



Protocol for the labelling 

The resin bound peptide was deprotected at the N-teraiinus following standard methods 
(piperidine in DMF) and reacted with an excess of the above reagent NHS-HCCA and 
wi* excess of a base (1 17 mg = 0.4 mol in 800 ^1 DMF with 1 eq. = 44.3 mg DIPEA). 
The mixture was incubated at room tenq)erature for 3h. 



Cleavage and work-up 

After the labelling, the resin was washed and the peptide was cleaved off of the resin by 
mcubation for 2 hours with the follov/ing scavenger mix: 
TFA:*ioamsole:tdethylsilane:^vater; 92.5 %:2,5 %:2.5 %:2.5 % respectively. The 
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peptide was precipitated and washed with diethyl ether and lyqphilised from tert- 
butanol/water (4:1) to yield a white powder. 

Structure of the desired compottnd 
NH-HRDPYRFDPHKD-COOH 



Molecular weight: 1753.87 g/mol (average) / 1752.78 g/mol (monoisotopic) 
Mono-isotopic composition: C80H104N24O22 

Example 5 - Analysis ofNHS-HCCA labelled peptide and comparison with unlabelled 
peptide 

The product of the labelling reaction was analysed by HPLC (see Figure 2) and was 
found to comprise a mixture of labelled and unlabelled peptide. The labelled p^tide 
comprises about 85 % of the mixture while the unlabelled peptide conq)rises about 15 % 
of the mixture. Analysis of the mixture by electrospray icwiisation mass spectrometry 
does not show any improvement in the sensitivity of detection of the labelled peptide 
however analysis of the mixture by MALDI TOF (Figure 3) shows that the labelled 
peptide, which is present as a smaller percentage of the mixture is detected with far 
greater sensitivity than the unlabelled peptide. 



NH-HRDPYRFDPHKD-OH 




OH 



36 



4-Apr-02 



Example 6 - Reaction ofNHS-L-HCCA with Resin-boimd P^tides 

The second active ester cinnainic acid mass tag (NHS-L-HCCA) coiiq>rising an additional 
six carbon chain linker, whose synthesis is described above, was attached to resin bound 
peptides according to the following procedures. 

Peptides 

The following peptide sequences were synthesised on a Wang type resin using standard 
Fmoc/tBu synthesis procedures: 

HSRGARGHSHAHRG (highly basic) M: 1522.62 

YDSGEGDESDAYEG (highly acidic) M: 1493.38 

LGWSGFGWSFSYLG (highly hydrophobic) ^ M: 1563.75 
(bound to a trityl type resin to yield a firee acid after cleavage) 




Molecular Weights 399.40 
Exact Mass = 399.14 
Molecular Formula- C20H21N3O6 

Molecular Composition = C 60. 14 %, H 5.30 %, N 10.52 %, O 24.04 % 
Protocol for peptide labelling 

The resin bound peptide was deprotected at the N-terminus following standard methods 
(piperidine in DMF) and reacted with an excess of the reagent NHS-L-HCCA and with an 
excess of a base (128mg = 0.4M in 800^1 DMF witih leq. = 44.3mg DBPEA). The 
mixture was incubated at room temperature for 3 hours. 

Cleavage and work-up 

After the labelling reaction, the resin was washed and the peptides were cleaved from the 
resin by incubation for 2 hours with the following scavenger mix: TFA 92,5 % / 
thioanisole; 2.5 % / triethylsilane; 2.5 % / water; 2,5 %. The peptides were precipitated 
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and washed with diethyl ether. FinaUy, the peptides were lyophilised from tert- 
butanol/water (4:1) to yield a white powder. 




Molecular weight: 1806.94 g/mol (average) / 1805.87 g/mol (mono-isotopic) 
Mono-isotopic composition: CygHi 1 1N33O20 




39 4-Apr-02 

NH-YDSGEGDESDAYEG-COOH 




OH 



Molecular weight: 1777.70 g6xiol (average) / 1776.63 g/mol (mono-isotopic) 
Mono-isotopic conqposition: C76H96N16O34. 
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NH-LGWSGFGWSFSYLG-COOH 




Molecular weight: 1848.06 g/mol (average) / 1846.84 g/mol (mono-isotopic) 
Mono-isotopic composition: C94Hii4Ni8022- 

Example 6 - Analysis ofNHS-L-HCCA labelled peptide and comparison with unlabetted 
peptide 

The peptides were analysed by HPLC. The first peptide, HSRGARGHSHAHRG, was 
isolated pure as the reaction with the tag containing the linker went to conqjletion. The 
labelled peptide was combined with unlabelled peptide to give a 50/50 mixture at a 
concentration of Ijimol/fil of each peptide. Analysis of the mixture by electrospray 
ionisation mass spectrometry does not show any inaprovement in the sensitivity of 
detection of the labelled peptide compared to the unlabelled peptide. However, O.Sjxl of 
the mixture on a metal target analysed by MALDI TOF MS shows that the labelled 
peptide is detected with greater sensitivity than the imlabelled peptide. This indicates that 
the linker functionality does not reduce the ejffectiveness of the tag and can be used to 
improve the yield of the labelling reaction. 
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Example 7 - Reaction ofNHS-L-HCCA with Beta-MSH Peptides 



Matrix MT 




Molecular Formula of NHS-L-HCCA: C20H21N3O6 
Molecular Weight: 399.41 

Peptide labelling in solution 

Protein: P-MSH pEGPYKMEHFRWGSPPKD, M: 2175) 

1 mg (0.46 nmol) peptide is dissolved in water/acetonitrile (250 jxl, 90/10). 30 jag trypsin 
in borate buffer (100 ^1, pH 8,0) are added to the peptide solution. The digestion is 
carried out for 1 8 h at 37°C. 

2.5 mg (6.3 mmol) NHS-L-HCCA in 250 \>L acetonitrile are added to 175 \i\ (1/2 of the 
digest solution) of the ttyptic peptide soluticm. The labelling reaction is carried out for 5 h 
at RT and at pH 7.,5. The reaction is then quenched by adding 100 pil H20/acetonitrile 
(95/5) and 1 % TFA. The labelled peptide solution is then experimented in MALDI MS. 
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Example 8 - Comparison of spectra from NHS-L-HCCA labelled beta-MSH peptides with 
unlabeled beta-MSH peptides 

Peptide mass fingerprints from beta-MSH 

Tryptic digest of unlabeled beta-MSH gives rise to 3 fragments as shown in Table 2 
below. 



Table 2 



m/z 


Fragment 
Positions 


Cleavage 


Composition 




Isotopic 

Mass 

(Baltons) 


Average 

Mass 

(Daltons) 


Start 


Finish 


No. of 

reactive 

groups 


Sequence 


708.3204 


708.7507 


1 


6 . 


2 


DEGPYK 


719.3299 


719.8458 


7 


11 


2 


MEHFR 


786.3786 


786.8682 


12 


18 


2 


WGSPPKD 


Tryptic digest of labelled beta-MSH gives rise to 3 fragments as shown in Table 3 below. 


Table 3 
















na/z . 


Fragment 
Positions 




Cleavage 


Composition 




Isotopic 

Mass 

O)altons) 


Start 


Finish 


No. of 
tagged 
reactive 
groups 


Sequence 




992.435 


1 


6 


1 


DEGPYK + 


1 




1276.535 


1 


6 


2 


DEGPYK + 
tags 


2 




1003.445 


7 


11 


1 


MEHFR+ 1 tag 




1287.560 


7 


11 


2 


MEHFR + 

tags 


2 




1354.609 


12 


18 


2 


WGSPPKD + 2 
tags 





Various experiments were performed on the above labelled and unlabelled digests. A 
schematic of the labelling protocol for beta-MSH is shown in Figure 5a. Figure 5b shows 
the IvL^LDI TOP mass sasctmrn of dis .unlabeled and imdiii-5.T.teti peptide. 
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Figure 6 shows spectra MALDI TOF for the unlabelled and labelled digests of beta-MSH. 
It can be seen that the 3 expected peptides for the xuilabeled digest are present in the 
spectrum of the unlabeled peptides. The labelled peptide spectrum shows 4 diffCTent 
species. The peptide con^prising amino acids 1 to 6 has 2 sites available for labelling; the 
alpha-amino group and the lysine epsilon amino group. Both of these appear to be fully 
labelled and only one species corresponding to the doubly labelled peptide appears in the 
spectrum. Similarly, the peptide con5>rising amino acids 12 to 18 has 2 sites available for 
labelling; the alpha-amino group and the lysine epsilon amino group and again both 
appear to be fully labelled and only one species corresponding to the doubly labelled 
peptide appears in the spectrum. The peptide comprising amino acids 7 to 1 1 should only 
label at the alpha-amino group but appears to have been labelled twice in the mass 
q)ectrum as 2 species corresponding to this peptide are detected in the spectrum of the 
labelled fragments. There is a minor side-reaction where the NHS-L-HCCA reacts with 
histidine giving rise to a second peak in the spectrum for this peptide. 

The singly labelled peptide comprising amino acids 7 to 11 of beta-MSH appeared most 
strongly in the spectrum of the labelled protein. This peptide fragment was isolated by 
HPLC chromatography and solutions of this peptide were prepared. Figure 7 shows 
MALDI TOF mass spectra of this peptide tnixed in various molar ratios witii the digest of 
unlabelled beta-MSH. It can clearly be observed that the detected abundance of the 
labelled peptide is greater than the unlabelled peptides even at the lowest ratio of labelled 
to unlabelled peptides (5:95). 

Figure 8 shows the results of a further experiment to demonstrate the enhancement in 
sensitivity provided by the tags of this mvention. Figure 8 shows a number of MALDI 
TOF mass spectra for a serial dilution of a stock solution of 44 pmol of the labelled 
peptide comprising anaino acids 7 to 11 of beta-MSH mixed with 180 pmol of digested 
and unlabeled beta-MSH. The first spectrum shows the MALDI TOF analysis of the 
stock solution. Both the labelled peptide and the unlabeled peptides can be seen in 
various dilutions of the stock solution (spectra not shown) until a 625-fold dilution is 
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reached, which is the second spectrum shown in Figure 8. Labelled and unlabelled 
peptides can still be seen at this dUution but noise can be seen in the low mass ranges of 
flie spectrum. A further 5-fold dilution to a dilution of 3125-fold gives a spedrum in 
which the unlabeled peptides can no longer be detected but the labelled peptide is still 
detectable. The labelled peptide is still detectable in a further 5-fold dilution of the 
solution (I5625-f61d). The above iresults all indicate that there is a significant 
enhancement in the sensitivity of detection of peptides when covalently attached to a tag 
con^rising a MALDI matrix. 

Example 9 - Peptide Mass Fingerprinting experiments with proteins 
A further set of experiments with whole proteins were carried out. The proteins bovine 
serum albumin (BSA) and glyceraldehyde 3-phosphate dehydrogenase (GAPDH) were 
analysed individually. 

Labelling ofTryptic Digests of BSA and GAPDH: 

BSA (Probe A): 69,2kDa/60*Lys/26* Arg 

GAPDH (Probe B): 35,7 kDa / 26 * Lys / 10 * Arg 

A stock solution of 2 ng/^il protein in borate buffer (50 mmol, pH 8) and wea/thiourea 
(100 mmol, 2/1) is freshly prepared. 20 ^1 (1 ng/jil) trypsin are added to 260 ^1 of protein 
stock solution. The enzymatic cleavage reaction is carried out for 16 h at 37°C, pH 8.0 

160 nl of the tryptic peptide solution are labelled with 2.5 mg NHS-L-HCCA dissolved in 
250 ^l acetonitrile. After 5 hours labelling at 25**C, pH 7.8, Ihe reaction is then quenched 
by addmg 100 jil H20/acetonitrile (95/5) + 1 % TFA. 

Example 10 - Comparison of spectra from NHS-HCCA labelled BSA peptides with 
unlabeled BSA peptides: 

Figure 9 shows a typical MALDI TOF peptide mass fingerprint mass spectrum for BSA. 
The mass determined in this spectrum can be used to search a datebase of known and 
predicted mass spectra to identify the protein from iis component tiyptic peptide fragment 
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masses. Figure 10 shows the results of a search of the SWISS-PROT database, with some 
of the masses from the spectrum shown in Figure 10, using the MS-Fit program on the 
ProteinProspector database server. This tool is available on the world wide web and 
allows the identification of proteins with modified amino acids (Qauser K- R., Baker P. 
R- and Burlingame A. L., Analytical Chemistry, Vol. 71(14): 2871, **Role of accurate 
mass measuremCTt (+/- 10 ppm) in protein identification strategies employing MS or 
MS/MS and database searching." (1999)). The modified antiino acid masses are entered 
into the program along with a series of modified peptide masses. The output of the 
program is a list of proteins that contain peptides that match the masses submitted to the 
database. The matching peptides for each matching protein, identified by MS-Fit, can 
then be displayed to determine how well the peptides match the protein, which is shown 
in Figure 1 0. This analysis was pCTformed with unmodified lysine. 

Figure 1 1 shows the corresponding MALDI TOF peptide mass fingerprint mass spectrum 
for NHS-L-HCCA labelled BSA. Figure 12 shows the peptides that match BSA from a 
search of the SWISS-PROT database usmg the MS-Fit program, in which the' 
appropriately modified lysine mass has been entered, along with peptide masses from the 
spectrum in Figure 11. 



Figure 13 shows a typical MALDI TOF peptide mass fingerprint mass spectrum for 
GAPDH. The mass determined in this spectrum can be used to search a database of 
known and predicted mass spectra to identify the protein from its conciponent tryptic 
peptide fragment masses. Figure 14 shows the results of a search of the SWISS-PROT 
database, with some of the masses from the spectrum shown in Figure 13, using the MS- 
Fit prograan, where unmodified lysine is searched for. 

Figure 15 shows the corresponding MALDI TOF peptide mass fingerprint mass ^ectrum 
for NHS-L-HCCA labelled BSA. Figure 16 shows the peptides that match GAPDH from 
a search of the SWISS-PROT database using the MS-Fit program, in which the 
appropriately modified lysine mass has been entered, along wifli peptide masses from the 
spectrum in Figure 1 5. 
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CXAIMS: 

1. A me&od for characterising an analyte by matrix assisted laser desorption 
ionisation (MALDI) mass spectrometry, which method conqxrises: 

(a) labelling the analyte with a light-absorbing label that absorbs light at a 
pre-detennined frequency, to form a labelled analyte; 

(b) embedding the labelled analyte in a matrix formed from at least one 
compound that absorbs light, to form an embedded labelled analyte; 

(c) desorbing the embedded labelled analyte by exposing it to light having Ihe 
pre-determined frequency, to form a desorbed analyte; and 

(d) detecting the desorbed analyte by mass spectrometry to characterise the 
analyte. 

2. A method according to claim 1, wherein the desorbed analyte is directly detected 
by mass spectrometry. 

3. A method according to claim 1, in which the desorbed analyte is indirectly 
detected by mass spectrometry, wherein the analyte is additionally labelled with a mass 
label relatable to the analyte, and wherein the mass label is cleaved from the desorbed 
analj^e and detected by mass spectrometry to dbaracterise the analyte. 

4. A method according to any preceding claim, wherein the li^t to which the 
embedded labelled analj^ is exposed is laser light 

5. A method according to any preceding claim, wherein the compound forming the 
matrix absorbs light at the same frequency as the light-absorbing label. 

6. A method according to any preceding claim, wherein the matrix and the 
light-absorbing label are formed from the same coinpound. 
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7. A method according to any preceding claim, wherein the matrix is a solid matrix 
or a liquid matrix. 

8. A method according to claim 7, wherein the matrix is a liquid matrix conqirising 
nitrobenzyl alcohol. 

9. A method according to any preceding clahn, wherein the matrix comprises an acid 
matrix or a basic matrix. 

10. A method according to claim 9, wherein the matrix comprises a compound 
selected fiom 3-hydroxypicoHmc acid, 2,5-dihydroxybenzoic acid and 
4-hydroxy-alpha-<^inocinnamic acid. 

11. A mefliod according to any preceding claim, wherein the li^t-absorbing label is 
formed from a dye. 

12- A method according to claim 1 1 , wherein the dye is a non-fluoresceat dye. 

13. A method according to claim 11 or claim 12, wherein the dye comprises 
4-dimetiiylaminoazobenzine-4'-sulphonyl chloride (DABSYL chloride), 
3-hydroxypicolinic acid, 2,5-dihydroxybenzoic acid and 4-hydroxy-alpha-cyanocinnamic 

acid. 

14. A method according to any preceding claim, wherein the analyte conqnises one or 
more conq)omds selected from a protein, a polypeptide, a peptide, a peptide fragment and 
an amino acid. 

15. A metiiod for characterising a polypeptide, which method conq)rises the steps of: 

(a) optionally reducing cysteme disulpMde bridges in the polypeptide to form 
free thiols, and capping the free thiols; 
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cleaving the polypeptide with a sequence specific cleavage reagent to form 
peptide fragments; 

optionally deactivating the cleavage reagent; 

capping one or more 8-amino groups that are present with a lysine reactive 
agent; 

analysing peptide fragments according to a method as defined in any of 
claims 1-14 to form a mass fingerprint for the polypeptide; and 
determining the identity of the polypeptide firom the mass fingerprint. 

16. A method for characterising a population of polypeptides, which method 
comprises the steps of: 

(h) optionally reducing cysteine disulphide bridges in one or more, 
polypeptides to form firee thiols, and capping the firee thiols; 

(i) separating one or more polypeptides from the population; 

(]) cleaving one or more polypeptides with a sequence specific cleavage 

reagent to form peptide fragments; 
(k) optionally deactivating the cleavage reagent; 

0) * capping one or more e-amino groups that are present with a lysine reactive 
agent; 

(m) analysing peptide fragments according to a method as defined in any of 
claims 1-14 to form a mass fingerprint for one or more polypeptides; and 

(n) detOTnining the identity of one or more polypeptides from the mass 
fingerprint 

17. A method for comparing a plurality of samples, each sample comprising one or 
more polypeptides, which method comprises the steps of: 

(h) optionally reducing cysteine disxilphide bridges and capping the free thiols 
in one or more polypeptides from the samples; 

(i) separating one or more polypq^tides from each of the samples; 



(b) 

(c) 
(d) 

(e) 

(f) 
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(j) cleaving the polypeptides with a sequOTce specific cleavage reagent to form 

peptide fragments; 
(k) optionally deactivating the cleavage reagent; 

0) capping one or more e-amino groups that are present with a lysine reactive 
agent; 

(m) analysing peptide fragments according to a method as defmed in any of 
claims 1-14 to fomi a mass fingerprint for one or more polypeptides from 
the samples; and 

(n) determining tiie identity of one or more polypeptides in the samples from 
one or more mass fingerprints. 

18. A method according to any of claims 15-17, wherein the lysine-reactive agent is a 
labelled lysine-reactive agent 

19. A method according to claim 17, for comparing a plurality of samples, each 
sample comprising one or more polypeptides, which method comprises the steps of: 

(a) optionally reducing cysteine disulphide bridges and capping the free thiols 
in one or more polypeptides from the sanq>les; 

(b) capping one or more 8-an3ino groups that are present in each sample with a 
labelled lysine reactive agent; 

(c) pooling the samples; 

(d) separating one or more polypeptides from the pooled san^les; 

(e) cleaving the polypeptides with a sequence specific cleavage reagent to 
form peptide fragments; 

(f) optionally deactivating the cleavage reagent; 

(g) analysing peptide fragments according to a method as defined in any of 
claims 1-14 to form a mass fingerprint for one or more polypeptides from 
the samples; and 

(h) deternoining the identity of one or more polypeptides in the sanq)les from 
one or more mass fing^printe. 
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wherein the same label is enqsloyed for polypeptides or peptides from flie same sainple, 
and different labels are employed for polypeptides or peptides from different sair?>les, 
such that the sample from which a polypeptide or peptide originates can be determined 
from its label. 

20. A method according to any of claims 15-19, wherein the sequence specific 
cleavage agrat cleaves the one or more polypeptides on the C-terminal side of a lysine 
residue. 

21. A method according to any of claims 15-20, wharein the specific cleavage reagent 
conqnises Lys-C or Trypsin. 

22. A method according to any of claims 15-21, wherein the peptide fragments having 
capped E-amino groups are removed by afiBnity capture, and wherein the lysine reactive ; 
agent comprises biotin. 

23. A melhod according to any of claims 15-22, wherein tibie lysine reactive agent 
comprises a hindered Michael reagent " ' 

24. A method according to claim 23, wherein the hindered Michael agent comprises a 
compound having the following structure: 




wherein X is an electron withdrawing group that is enable of stabilising a negative 
charge; the R groups independently comprise a hydrogen, a halogen, an alkyl, an aryl, or 
an aromatic group with the proviso that at least one of the R groups conoprises a sterically 
hindering group; and the group r2 coinprises a hydrogen, a halogen, a hydrocarbon 
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group, an electron withdrawing group and/or a linker capable of attachment to an aflHnily 
capture functionality or a solid phase support. 

25. A labelled analyte compound, which compound has the following structure: 



wherein D comprises a ligjit absorbing label, M comprises a mass modifier, L comprises a 
linker and A comprises an analyte. 

26. A compoimd according to claim 25, wherein D conqdises a non-fluorescent dye. 

27. A compound according to claim 26, wherein D comprises a cinnamic acid 
derivative, a nicotinic acid derivative, a picolinic acid derivative, a hydroxybenzoic acid 
derivative, a methoxybenzoic acid derivative or a sinapinic acid derivative. 

28. A compound according to claim 26 or claim 27, wherein the non-fluorescent dye 
comprises a compound selected from 4-dimethylaminoazobenzine-4'-sulphonyl chloride 
(DABSYL ' chloride), 3-hydroxypicoUnic acid, 2,5-dihydroxybenzoic acid and 
4-hydroxy-alpha-cyanociimainic acid. 

29. A compoimd according to any of claims 25-28, wherein M is selected from a 
cotnpoimd formed from an aryl ether, and an oligomer formed from 2 or more aryl ether 
units. 



30. A compound according to any of claims 25-28, wherein the linker comprises a 
group selected from -CR2-CH2-SO2-, "N(CR2-CH2-S02-)2, -NH-CR2-CH2-SO2-, 
-CO-NH-, -CO-0-, -NH-CO-NH-, -NH-CS-NH-, -CH2-NH-, -S02-NH-, 
-NH-CH2-CH2- and -0P(==0)(0)0-. 



D-M-L-A 
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31. A compound according to any of claims 25-30, wherein A is selected from a 
protein, a polypeptide, a peptide, a peptide fragment and an amino acid. 

32. A conq)ound for labelling an analyte, which conq)ound has the following 
structure: 



wherein M is a nnass marker comprising a cinnamic acid derivative, a nicotinic acid 
derivative, a picolinic acid derivative, a hydroxybenzoic acid derivative, a 
methoxybenzoic acid derivative or a sinapinic acid derivative, L connprises a hnker and R 
comprises a reactive frmctionality for attaching the con^oimd to an analyte. 

33. A compound according to claim 32, wherein M comprises a non-fluorescent dye 
conipound selected from 4-<limethylaminoazobenzine-4*-suIphonyl chloride (DABSYL 
chloride), 3-hydroxypicolinic acid, 2,5-dihydroxybenzoic acid and 
4-hydroxy-alpha-cyanocinnamic acid. 

34. A compound according to claim 32 or claim 33, wherein M further comprises a*^ 
compound formed from an aryl ether, or an oligomer formed from 2 or more aryl ether 
units. 



35. A compound according to any of claims 32-34, wherein the linker conqprises a 
group selected from -CaR2-CH2-S02-, -N(CR2-CH2-S02-)2» -NH-CR2-CH2-SO2-, 
-CO-NH-, -C0-0-, -NH-CO-NH-, -NH-CS-NH-, -CH2-NH-, -S02-NH-, 
-NH-CH2-CH2-, -0P(=O)(O)0- and an amide linkage through the acid groups of M. 



M-L-R 



36. A conipound according to any of claims 32-35, wherein R is for attaching the 
compound to a protein, a polypeptide, a peptide, a peptide fragment or an amino acid. 
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37. A conqjoimd according to any of claims 32-36, wherein R conq>rises an ester 
groxq>, an acid anhydride group, an acid halide group such as an acid chloride,, an 
N-hydroxysucdnanride group, a pentafluorophenyl ester group, a maleindde groiq>, an 
aUcenyl sulphone group, or an iodoacetamide groiq>. 

38. An array of two or more compounds for labelling an jiialyte, whereiE the 
conq)ounds in the array are compounds as defined in any of claims 32-37 and wherein 
each compound in the array differs in mass from all other compounds in the array. 

39. An array of conq)ounds according to claim 38, wherein the difference in mass of 
each of the con^oimds in the array is achieved by isotopic substitution. 

40. A kit for characterising an analyte by matrix assisted laser desorption ionisation 
(MALDI) mass spectrometry, which kit con^ses: 

(a) one or more light absorbing labels having a reactive functionality for 
attaching the labels to an analyte; and 

(b) a compound for forming a matrix, which conapound absorbs Ught at the 
same frequency as the Ught-absorbuig labeL 

41 . A kit according to claim 40, wherein the component (a) comprises a compound, or 
an array of coiBpounds, as defined in any of claims 32-39. 
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ABSTRACT 

CHARACTEiaSING ANALYTES 

Provided is a method for characterising an analyte by matrix assisted laser desorption 
ionisation (MALDI) mass spectrometry, which method comprises: 

(a) labelling the analyte with a light-absorbing label that absorbs light at a 
pre-detemiined frequency, to form a labelled analyte; 

(b) embedding the labelled analyte in a matrix formed from at least one 
compound that absorbs light, to form an embedded labelled analyte; 

(c) desorbing the anbedded labelled analyte by exposing it to light having tiie 
pre-determined frequency, to form a desorbed analyte; and 

(d) detecting the desorbed analyte by mass spectrometry to charact^se the 
analyte. 
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Synthesis of 2-Cyan-3-(4-tert-butoxycarbonyloxyphenyI) acrylic acid-N- 
(hydroxysucclnimldesfer) [NHS-HCCA]: 




OH 



189,2 

JL 





Synthesis of 2-Cyan-3-(4-tert-hydroxyphenyI) acrylic acld-6-[(2,5-dloxo-1- 
pyrrdlldinyl) oxyH6-oxo) hexyl] amide [NHS-L-HCCA]: 



NH2> 



OH 



CN 



OH 



OH 



131 



207.7 

JL 



19B^ 

JL 





en 

d> CN 



oo 
o oo 
to x^ vo 
^ t** oo oo 

S oo OS vo 
vS- o i-« oo 



1 



• C3N VO 
O ON OO 
CS CS 

^ cnrocn 

aoo a\ vo 
, , O ^ oo 



CO 



CO 



a> 

p 



2h- 



SI 




CO 



O 

S 



i 



O 

o 
o 



o 




it 



ST VO VO '-I T-l r-H 

r-i t— r 



G 



^ cn cn vo o 

M OV O OO 

5 J^J^J 



Si 



31 



3 



« CM JT 



J2i 



^1 



T r 



CO 



o 
o 

CO 



o 

CD 



S 



II 



s 

s s 



<D 



CO 




■i 



s 




!5 



i 
1 



II 



t 



o 
o 



OQ 



S 

o 

I 
•g 



-I 

li 



-T — I — I — I — r — r 



5 



S 



Ausue^l 



i 



•s 



>1 



s 



1 



o 

CO 



-r 1 1 1 1 r 



CO 



1 

CO 

I 

f 

< 

pa 
o 

CO 

a 

o 
o 



s 

CO 



I 




^0 V5 ^ 
iH 



__i ^ 4^ «m{ t*~- 0\ 00 ON CO On QO 



t 
vo 



6^ 

oo 




I 

1 g 



d=\ VO 



ON CO ^ ON OS ^ ^ ^ g 06 in 2 I 
^ TT m NO vo op 



vo QO O 



^ ^ ^ 2 2 IS S2 
th th iH tH 



ON d ^ <^ ■'^ 

o S 55 t- ^ 

oo O O ON ON O 

^ S € VO VO OO 

^ fn ON en v> ^ 



OTl*00'^<^rHQ0 0NO 

§^vorjvgt-vojn^ 
ooenmc^t-tmcnvovo 

tS?20N00aNONOC?V0 

ONi-^ONjrl^cnooin^ie 



i 

8- 

I 



CO 



O 
O 

CV4 



5, 



00 



1 

+ 

PQ 
o 




JSI 

E 



.a 





&5 

cocQCQCOMCoogog co 



3 



g 



>5! 




ggesg 




m o^ 00 ^ 



tH 



so O O CO 



S in a 52 ^ o. 



T T V ^ ? ^ 



GO 



cnr 



I 



ON 



CI S O ON "O t-- T-i VO O t> CJ 



ON ON 

vo o\ 



in 

ON O 

«5 ^ 
r-i oo 

oo 

r-i «S 



V10\0N00O00TtO>0000t^<S 

^--ooSo^^ mv>«r)ooooosONONoqT-< 



CO 



w oo 



m ^ oo ^ ^ 



oo On to CO v5 
— vb vo t<i vo ON »n 



i. 

o 

CO 

1 

a 




o 




I. 



11 

fill 



s 
s 



ST 




t** in CO o m <s 

1^ Xf -O0-f<|- OO W .tH 



o CD o W 

? fO ^ S >2 «^ «9 



I 

a 

O 
CO 



oo 

II 



s 

<1 



V) o\ «s vo 25 S S? 

vo (S oo fn oo oe e\ 

o\ so vo iH oo rr 

So §1 «^ vj o 



ON . 



s 



o oo 05 W £2 

«s a\ o c<i ^ vj 22 

«3 ^ ^ 

ON ?n yn 52 5::: a 

^ ri S2 

ON ^ t;- jn en 

tH <^ eo en 



m »r) en 5 f2 

e-^ tn 00 vo 

fn 00 th 00 en o 

in t** 00 00 cn in 

t> ON en oo t-; ov 

ON th T-i en 

^ ^ 15 5:: S £2 J2 

iH iH f<< en en 



-a 
o 

m 



^^^^ 



v-ri'C'S' isri ^ =55 "%: 
p:>, ei^ r--H ^ 

nr» r-Q r-<^ r>^ T/Q 
f5 rH tH ir-C 



=5 
1^ 



i 

CD 
CO 



00 

i 

o 

CO 

• »— « 

i 

i 

A. 

o 



§ 

ST 




CO 



o m o cai o 



tn m 



Tt <s 

00 vj ?5 2 
T-< <s r<i ^ to 



s 



■13 

S 



« ? ? ^ ?i ^ S 



vo .On . ON 1-H - 52 gv fl- Si 



I 



t^fnr^ tr- oo QC ^ ^ CO. 



<^ 00 oo vo oo 




This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 
JZTbLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□fuNES OR MARKS ON ORIGINAL DOCUMENT 

□ R£FERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



4 



